Convergence of macrostates under reproducible processes 



Jochen Rau 



Institut fiir Theoretische Physik, Johann Wolfgang Goethe-Universitdt, Max-von-Laue-Str. 1, 60438 Frankfurt am Main, 



I show that whenever a system undergoes a reproducible macroscopic process the mutual distin- 
guishability of macrostates, as measured by their relative entropy, diminishes. This extends the 
second law which regards only ordinary entropies, and hence only the distinguishability between 
macrostates and one specific reference state (equidistribution). The new result holds regardless 
of whether the process is linear or nonlinear. Its proof hinges on the monotonicity of quan- 
tum relative entropy under arbitrary coarse grainings, even those that cannot be represented by 
completely positive maps. 
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1. Motivation 

As discussed lucidly by Jaynes fP] the second law of thermodynamics follows directly from 
the requirement that macroscopic processes be reproducible. The microscopic details of a macro- 
scopic system are generally beyond the reach of an experimenter; all she can observe and control 
is some limited set of macroscopic variables, typically pertaining to the constants of the motion 
or, if out of equilibrium, slowly varying degrees of freedom |2]. The system is modelled by a 
(generalized) canonical distribution 



where {Ga] are the observables whose expectation values characterize the system's macrostate. 
With properly adjusted Lagrange parameters {A") this canonical state encodes information about 
the "relevant" expectation values [ga = (Ga)^], while discarding (by maximising entropy) all 
other information. The preparation of the system amounts to setting the initial values of the rele- 
vant expectation values or of their conjugate Lagrange parameters. The system then undergoes a 
macroscopic process, at the end of which its macrostate is again characterized by the expectation 
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values pertaining to some small (possibly different) set of observables {Fh). The process is re- 
producible if the preparation uniquely determines the outcome; i.e., if merely on the basis of the 
initial {gg] one can predict the final {ft,}. A prediction cannot possibly contain more information 
than the data on which it is based. So the final expectation values cannot carry more information 
than do the initial expectation values; and hence indeed, the corresponding final entropy cannot 
be smaller than the initial entropy. 

The above argument translates into a simple mathematical proof of the second law as fol- 
lows. By assumption the initial state is modelled by ju^. On the microscopic level (and possibly 
enlarging the system to include any environment it might be coupled to) every process is unitary, 
fig —> UfigU^, preserving entropy: S[Ufj.gU^] - S\jig]. Again by assumption, the final state's 
macroscopic features can be described just as well by the canonical distribution fif, where the 
relevant expectation values must be consistent with the underlying unitary dynamics: 

fb - {Fb}uiigU^- (2) 

The replacement UfigU^ ~^ t^f constitutes a coarse graining that discards information about all 
degrees of freedom but the relevant expectation values; it can thus only increase the entropy, 
S[Mf] >S[UiUgU'^]. Hence S[fif] >S[iUg], Q.E.D. 

Rather than in terms of ordinary entropy the second law can also be formulated in terms of 
relative entropy. The relative entropy between two states p and cr, 

5(p||cr):=tr(plnp-plncr), (3) 

n 

IS a measure for their distinguishability [3]. A precise operational interpretation is furnished by 
the quantum Stein lemma |4-6]: The probability that, given a sample of size taken from an 
i.i.d. source of states cr, tomography will erroneously reveal the state p decreases exponentially 
with N as N —> oo, 

prob,(p^|cr^) ~ exp[-NS (p\\cr)] (4) 
regardless of the threshold e (0 < e < 1), where 

prob,(p^|c/') := ^^inf ^ {((r^|r)| (p^lD > e) , (5) 

{X\Y) :- tr(X^Y) and, e.g., p^ is short for p®'^. Loosely speaking, therefore, the relevant entropy 
drives the probability of mistaking cr for p. With this definition of distinguishability the statement 
of the second law - that any reproducible process increases entropy - is now equivalent to saying 
that under a reproducible process macrostates become less distinguishable from the uniform 
distribution: 

5(///||l/ti-l)<50uj||l/trl). (6) 

As the second law reflects the tendency of macrostates to move closer to equidistribution, 
one would intuitively expect that they have a tendency to converge, and that hence not only 
the distinguishability between any macrostate and the uniform distribution (which is one par- 
ticular macrostate) diminishes but also the mutual distinguishability between arbitrary pairs of 
macrostates. Such would mean that for any two initial macrostates fig and jj.g> evolving under the 
same reproducible process to final macrostates jj.f and fif, respectively, the relative entropy can 
only decrease: 

S(fif\\f,f,)<S(fig\\fig,). (7) 
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This assertion is not implied by the second law; rather, if true, it would constitute a nontrivial 
extension. In fact, it is known to be true for macroscopic processes that are linear: A linear 
process is represented by a completely positive map <b for which by Lindblad's theorem ^ 

S(<i(p)mcr))<S(p\\cr) (8) 

for arbitrary p and cr. Yet not all macroscopic processes are linear — many real processes such 
as, say, the dynamics of a fluid are governed by transport equations that are nonlinear. For this 
more general situation I am not aware of a proof of the inequality (|7]). Closing this gap is the 
purpose of my Letter. 



2. Derivation 

My line of argument is inspired by the above simple proof of the second law. Like the 
ordinary entropy, relative entropy is invariant under unitary microscopic evolution, so 

S(U^igU^'\\Ufig,U'') = SifigW^ig,). (9) 

By assumption, for the purposes of a macroscopic description the transformed states on the left 
hand side may be replaced by and fiff, respectively, where the coarse grainings f//igf/' ~* A*/ 
and Ufig'U^ — » fif preserve the respective expectation values of the relevant observables {Fh}. 
Then a la Jaynes the inequality (|7]) follows immediately if only one can show 

siMfWiif) < s(UfigU-H\Ufig,u'^y, (10) 

or more generally, 

S(Mf(p)\Wf«r))<S(p\\cr) (11) 

with { fh(p) = {Fh)p] and {/i(cr) = {Fh}^]- Such monotonicity appears plausible because one 
intuitively expects coarse graining to diminish the distinguishability of quantum states; and in- 
deed, it is known to be true (again by Lindblad's theorem) whenever the coarse graining can be 
represented by a completely positive map. Yet again, not all coarse grainings are of this form. 
For example, if the system under consideration is bipartite (A x B) and the relevant observables 
are all those pertaining to either A or B then coarse graining amounts to removing correlations, 
pAB pA x Pb, which is not a linear map. The remainder of this Section shall be devoted to 
showing that monotonicity holds in these nonlinear cases, too. 

The proof will make use of the existence and properties of the so-called Kawasaki-Gunton 
projector, a mathematical object originally introduced and used in the context of nonlinear trans- 
port equations 12,111] . A special case of such a Kawasaki-Gunton projector is the (super- )projector 
!P^j which acts on the space of observables such that for arbitrary positive F (0 < F < 1) 

(p'^llP^^jF) = ifi'J.^JD. (12) 

This projector has a nontrivial form that depends on and on the relevant expectation values. 
Its main properties of interest here are that (i) it is always linear, even when the corresponding 
coarse graining in state space is not; (ii) it preserves positivity, so {PF] c {F}; and (iii) it is 
idempotent in the sense 

^f(prf(a-) ~ ^M) ^'^^J 
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even for p + cr. 

By the quantum Stein lemma, proving the monotonicity (fTTI) is tantamount to showing that, 
asymptotically, 

prob,,^^)|;uf(^)) > prob,(p^|cr^) (14) 

for conveniently chosen e' and e. With the definition Q and the complement (3 := 1 - 'P of the 
Kawasaki-Gunton projector the left-hand side of this inequality can be written as 

P™b.K(,)K(.)) = mf , {(-"l^^:)r)| ip'WZ,^) > e - (/.^^,ie^:,r)} . (15) 

Whenever the coarse graining is represented by a completely positive map the Kawasaki-Gunton 
projector becomes independent of the expectation values {//,}. The term (yulQF) then vanishes, 
and for the remainder the infimum can be taken just as well over {iPF} rather than {F). As 
infpr ^ infr this implies immediately the monotonicity (fT4l l. As soon as the coarse graining is 
nonlinear, however, further analysis of the term (yu|6}F) is needed, to which 1 turn next. 
For any state p and any canonical state yUy 1 define 

r(p,ju/) := sup sup |(p^|Qf 'F)|. (16) 

N o<r<i 

This quantity lies in the range [0, 1]; it vanishes at p = /j./; and it can attain the maximum value 
y - 1, if at all, only for p that have at least one zero eigenvalue. If p is replaced by fif(p) then the 
latter can only be the case if one of the associated expectation values {ftip)} (which are different 
from the {//,)) takes an extremal value; so as long as one excludes extremal expectation values, 
y(///(p),yU/) is always strictly smaller than one: 

0<y(Mf(p>,tif)<l V fi,ip)e{fr\fr)- (17) 
Consequently, the choices 

e' = (l+y(///,p),A(/(.)))/2 (18) 
e^(l-y(^lf(p),^lf(a)))/2 (19) 
both lie in (0, 1) and ensure that for any and F 

^'-0"/(p)l4(<^)r)>e. (20) 

Thus from Eq. (flSl l. 

p^-K^(MlK<r? ^ oM, {(-"iCnl (p'lrZp ^ ^} ■ (21) 

On the right-hand side the infimum can now be taken just as well over {'PftlP rather than {F}; 
and as infpr ^ infr this implies the monotonicity (fT4b . Q.E.D. 



3. Conclusion 

Under a reproducible process macrostates have a tendency not only to move closer towards 
equidistribution (as stated by the second law) but also to move closer towards each other, as I 
have shown in this Letter. The proof depended crucially on the monotonicity of quantum relative 
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entropy under arbitrary (including nonlinear) coarse grainings. This generalized monotonicity 
is an interesting result in itself, with useful implications not only for statistical mechanics but 
also for quantum information theory. For instance, in the earlier example of coarse graining by 
removing correlations it implies 

S(paX PbWo-a X o-b) < 5 (pabWo-ab), (22) 
a much stronger result than the well-known monotonicity S (paWo'a) ^ S (pabWcab)- 
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